arXiv:1504.07897vl [physics.bio-ph] 29 Apr 2015 


Conditions for positioning of nucleosomes on DNA 


Michael Sheinman, Ho-Ryun Chung 
Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany 
(Dated: April 30, 2015) 

Positioning of nucleosomes along eukaryotic genomes plays an important role in their organization 
and regulation. There are many different factors affecting the location of nucleosomes. Some can be 
viewed as preferential binding of a single nucleosome to different locations along the DNA and some 
as interactions between neighboring nucleosomes. In this study we analyzed how well nucleosomes 
are positioned along the DNA as a function of strength of the preferential binding, correlation length 
of the binding energy landscape, interactions between neighboring nucleosomes and others relevant 
system properties. We analyze different scenarios: designed energy landscapes and generically dis¬ 
ordered ones and derive conditions for good positioning. Using analytic and numerical approaches 
we find that, even if the binding preferences are very weak, synergistic interplay between the in¬ 
teractions and the binding preferences is essential for a good positioning of nucleosomes, especially 
on correlated energy landscapes. Analyzing empirical energy landscape, we discuss relevance of our 
theoretical results to positioning of nucleosomes on DNA in vivo. 

PACS numbers: 


I. INTRODUCTION 

Our genome is packed and organized by nucleosomes— 
histone octamers wrapped around by 147 bp of DNA [TJ 
|5]. Nucleosomes are in some cases very well positioned 
while in others they are rather "smeared" along the DNA 
molecule mil]. Their positioning properties are known to 
be important in the regulation of gene expression EHT]. 
There are many factors which determine positioning of 
nucleosomes along the DNA and their relative influence 
is a matter of active debate in the held (see Ref. j8] for a 
review). 

The most discussed positioning mechanism is the DNA 
sequence heterogeneity. It is well known that to wrap 
DNA around a nucleosome one needs different energies 
for different DNA sequences mm- In this case the de¬ 
bate is only about the importance of DNA sequence pref¬ 
erences, relative to other factors. 

An obvious competitor of sequences preferences for nu¬ 
cleosomes positioning is thermal fluctuations. All mea¬ 
sured binding energy differences between different se¬ 
quences do not exceed a few feeT even for specially de¬ 
signed strongest binders, which do not exist in known 
genomes [iiHin]. This indicates that, at least in equilib¬ 
rium, entropic forces are expected to play an important 
role. It is not entirely clear whether the nucleosomes 
reach (quasi)equilibrium and how they do it. It was sug¬ 
gested that some active chromatin remodeling enzymes, 
facilitate the equilibration of the nucleosomes by increas¬ 
ing the off-rate of the nucleosomes from the DNA m- 
These and others active chromatin remodeling enzymes 
HZIIIH] and DNA-binding proteins m also affect nucle¬ 
osomes positioning by actively moving the nucleosomes 
and by DNA binding competition. 

In addition to external positioning signals, there are ar¬ 
guments for and evidences of interactions between neigh¬ 
boring nucleosomes along the DNA |2ni[2^ . The interac¬ 
tions are also expected to affect nucleosome positioning. 


This positioning factor is different from the one men¬ 
tioned above, since it depends not on an absolute position 
of a nucleosome on the DNA, but on a relative position of 
two nucleosomes—the distance between two neighboring 
nucleosomes. 

In this study for simplicity we divide the positioning 
factors to two types. The first type includes all the fac¬ 
tors which determine the position of a single nucleosome. 
One can characterize it by an effective binding energy 
landscape of a nucleosome along the DNA molecule that 
depends only on the location of the nucleosome along 
the DNA. The second type corresponds to interactions 
between neighboring nucleosomes. We characterize this 
positioning factor by an effective interaction potential 
that depends only on the distance between two neigh¬ 
boring nucleosomes. We assume that nucleosomes can¬ 
not invade each others DNA territories (although it is 
not entirely true |30It5^ this is not expected to affect 
significantly the conclusions of this study). In addition 
we analyze only the equilibrium distribution of nucleo¬ 
somes, ignoring non-equilibrium aspects. 

In general, we address the following question: within 
the framework of the above assumptions, what should be 
the properties of the effective energy landscape and ef¬ 
fective interaction potential between neighboring nucle¬ 
osomes to achieve good positioning? We analyze energy 
landscapes with different properties and different interac¬ 
tion potentials and derive conditions leading to good po¬ 
sitioning of nucleosomes. We show that the interactions 
between nucleosomes can significantly improve their po¬ 
sitioning even on almost flat and highly correlated energy 
landscapes. In this case, if the positioning is good, one 
expects to observe also large length-scale fluctuations of 
nucleosome occupancy along the DNA. Comparing our 
results to empirical study, we And good qualitative and 
quantitative agreement. 

Before we start with detailed description of the model 
and its analysis, it might be instructive to illustrate the 
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main message of the paper with a toy example. Con¬ 
sider non-correlated Gaussian binding energy landscape 
with standard deviation of l-SfeeT, on a circular DNA of 
length 400bp (see Fig. [^a)). Ten non-interacting "nucle- 
osomes" of size Ibp cannot be well positioned with such a 
weak energetic disorder (see Fig. Be)). However, adding 
very strong interaction between the nucleosomes, such 
that the distance between them is restricted to 15bp, one 
get good positioning on the same, weak energetic profile 
(see Fig. B®))- Autocorrelation of an energy (see Fig. 
[^b)) makes the positioning even more problematic (see 
Fig. Bb))- However, again, strong interactions between 
the nucleosomes improves it to a reasonable level (see Fig. 
Bd)). In the paper we will derive, within quite general 
set of assumptions, conditions for positioning on uncor¬ 
related and correlated correlated binding energy land¬ 
scapes. In Figs. B®) ^’^d (f) one can see that, when 
interactions between the nucleosomes are exploited for 
a better positioning, there are long length-scale fluctu¬ 
ations of the nucleosomes occupancy along the DNA— 
there are long enriched and long depleted regions. Below 
we analyze the properties of such regions and demon¬ 
strate relevance of this effect to empirical data. 

The structure of the paper is as following. In Section 
El we formulate the model we use. In Section IHII we 
define the quantities we use to characterize positioning 
of nucleosomes on the DNA. In Section |IV| we analyze 
positioning of a single nucleosome. The purpose of this 
Section is not only didactic, because we use its results 
below. In Section we analyze positioning of a many 
nucleosomes with only hard-core interactions. The pur¬ 
pose of this Section is to contrast it to the case with in¬ 
teractions between neighboring nucleosomes and demon¬ 
strate the importance of these interaction in Section [vg 
We generalize our results for energy landscapes with au¬ 
tocorrelation in Section IVHI In Section IVHII we discuss 
relevance of our conclusions to real systems and compare 
to empirical results. After discussion about tunability 
and robustness of positioning issues, emerging from our 
results, in Section |IX| we summarize in Section We 
proceed now with a detailed description of our model. 


II. THE MODEL 

We analyze the following lattice based model, which is 
often used to calculate occupancy of DNA-binding pro¬ 
teins |33]. In a grand-canonical ensemble on average N 
nucleosomes are located on a linear DNA of length L, 
in units of bp with reflecting boundaries. Note, that 
near the saturation of the DNA by nucleosomes the 
grand-canonical and canonical ensembles may be differ¬ 
ent nsiEi]. Each nucleosome occupies W = 147bp on 
the DNA, such that if its leftmost position is bound 
to a site i another nucleosome cannot bind with its 
leftmost position to any of the sites in the interval 
[i — W + l,i + W — 1]. Due to DNA sequence prefer¬ 
ences of nucleosomes or any other reason a nucleosome 


bound with its leftmost position to site i possesses an 
energy Ei. In addition to this energy there is an interac¬ 
tion energy between two neighboring nucleosomes. Given 
the distance r > 0 between the leftmost positions of the 
two nucleosomes the interaction energy is given by V (r). 
The hardcore interaction is realized by V{r) = oo for 
0 < r < W. To obtain the equilibrium properties of the 
nucleosomes we numerically solve the recursive equation 
for the partition function inSJ 155] . 

Our focus is the following question: what should be 
the properties of the signal in the one-nucleosome energy 
profile along the DNA, Ei and the interaction between 
the nucleosomes, V{r), to achieve good positioning of 
nucleosomes on the DNA? In the next Section we define 
this question in more quantitative terms. 


III. QUANTITIES OF INTEREST 

In this paper we focus on several quantities which re¬ 
flect positioning of nucleosomes. Each one of them can 
be derived from an equilibrium probability of the site i 
to be covered by the leftmost position of a nucleosome, 
rii (start site probabilities). The average number of nu¬ 
cleosomes, N, is given by 

L 

N = Y.n.. ( 1 ) 

i=l 

We also define an ordered vector of occupancies, n'^, 
such that n° is the occupancy of the most occupied site 
(site with the highest value of n^), ri§ is the occupancy 
of the second-most occupied site etc. 

For cases when it is not important how a base-pair 
along the DNA is covered by a nucleosome (by which part 
of the nucleosome it is covered) the occupancy function 


w-i 

p^= 

i=o 

is of an interest. We define the average coverage of the 
DNA by 


P = 


1 

L 


L 




NW 

L 


( 3 ) 


There are different ways to define a measure of how 
well nucleosomes are positioned along the DNA. In this 
paper we use a very simple one: given that there are 
N nucleosomes, we define as V the fraction of nucleo¬ 
somes which occupies N most occupied locations along 
the DNA. Namely, 


r = 


N 

m—1 

E 


1 


( 4 ) 
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Figure 1: Illustration of how interactions between nucleosomes improves their positioning along the DNA. In this toy example 
"nucleosomes" are merely particles of Ibp size with disordered binding energy landscape on 400bp-long, circular "DNA". (a) 
Binding energies of a single nucleosome are i.i.d random variables with a Gaussian distribution with the standard deviation of 
a = l.SfcBr. (b) Binding energies of a single nucleosome are normally distributed with the standard deviation of a and are 
correlated, such that {EiEi+r) = cr^e”^, with rc = 20. (c) The probability that the site i is occupied by a nucleosomes, Ui 
is plotted vs. i for 10 non-interacting nucleosomes located on the binding energy profile from (a), (d) The same as (c) but for 
the binding energy profile from (b). (e) The same as (c) but for nucleosomes with strong interactions, such that the distance 
between two neighboring nucleosomes is constrained to 15bp. (f) The same as (e) but for the binding energy profile from (b). 


In the case of a single nucleosome, iV = 1 this definition 
becomes simply the occupation probability of the ground 
state—the order parameter of, say, the Random Energy 
model m- For multiple nucleosomes V can be viewed as 
a fraction of well positioned nucleosomes. In the case of 
a perfect positioning V — 1, while in case of positioning 
whatsoever, V — N/L 1. The last quantity, N/L, can 
be at most XjW for p = 1. 

As we show below, for correlated energy landscapes, in 
some cases nucleosomes are not positioned well on a sin¬ 
gle bp length scale but are positioned well within a few 
basepairs. In this case the value of V does not charac¬ 


terize fully the positioning goodness. For those case we 
exploit the following generalization of V. Denoting the 
profile around the m’th largest values of rii, as u^(s) we 
define for odd values of k 

N (fe-i)/2 

= E ( 5 ) 

m=l s=—(fc —1)/2 

The value of Vk is the measure of positioning given that 
one does not care about fuzziness on the lengthscale of k. 
One can easily see that on the level of one bp resolution 
Vi is given by V. However, as we show below, on corre- 
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lated energy landscapes and/or with interaction potential 
with wide wells the function Vk can be much more infor¬ 
mative than its single-bp resolution value Vi — V. We 
turn now to the consideration of positioning for different 
scenarios, starting from the simplest one. 


IV. POSITIONING OF ONE NUCLEOSOME 

It is instructive to consider first the simplest case of 
a single nucleosome, TV = 1, on a DNA of a certain 
length, L. To position it on the DNA in equilibrium 
one should generate non-uniform energy profile along the 
DNA. Below we discuss possible energy profiles which can 
be roughly divided to designed and generic ones. 


A. Designed energy landscape 


Conceptually, the easiest way to position a nucleosome 
on a site j is to design an energy landscape such that up 
to an additive constant Ei^j = —E and = 0. In 

this designed DNA case the positioning measure Q is 
given by (we measure all energies in units of k^^T) 


V = nl 


1 

1 -f Le-^ ’ 


( 6 ) 


such that the nucleosome occupies site i with probability 
of order one if if > InL. Therefore, having in the arsenal 
only energies of the order of a few k^T one can position 
a single nucleosome only on short sequences of tens— 
hundreds base-pairs, even on the best possible energy 
landscape. 


B. Disordered energy landscape with Gaussian 
distribution 


In the more generic case of a disordered energy land¬ 
scape the problem can be mapped to the Random Energy 
model |87flin] . In this case the probability that the left¬ 
most location of the nucleosome is located on site i along 
the DNA is given by 


Ui = 



where the partition function is 


( 7 ) 


Z = (8) 

i=l 


Therefore, to position a nucleosome on site i one has to 
fulfill the condition ~ Z. Consider the case where 
the energies {E^} are a set of i.i.d random variables with a 
normal probability distribution with standard deviation 
a: 


e 2 ^ 


In this case the lowest energy, E°, is well approximated 

by 



Pr (A) dE 


1 

L 


( 10 ) 


The solution is given by 



~ —aV2 In L. 


( 11 ) 


In the limit of zero temperature (or, equivalently infi¬ 
nite disorder strength a) the nucleosome will occupy the 
state with the lowest energy (ground state) with proba¬ 
bility 1. However, for small non-zero temperatures non¬ 
ground states will be partly occupied, such that the occu¬ 
pation probability of the ground state is smaller than one. 
Consider the m-lowest energy (1-lowest energy means the 
lowest one, 2-lowest energy is the second lowest energy, 
etc.) on the DNA, £!(/. Its value can be well approxi¬ 
mated by 


l^'^PTiE)dE=j ( 12 ) 

The solution is given by 



where the freezing disorder strength is given by 

(Tf ~ —v^erf”^ -1^ — ^2InL. (14) 

The affinities of the m-lowest state are given by 

A? , (15) 


In the low temperature (high disorder) limit, cr > erf the 
occupation of the lowest state is given by 


r = n° 






(16) 


where /'(s) = X)m=i ^be Riemann zeta function. 

Above the freezing point cr < tJf the sum in the equation 
above diverges. In this case the annealed approximation 
of the free energy is valid and the partition function is 
not widely distributed around its mean value 


(Z) =Le'"'/^. (17) 


In this regime the probability of occupation of any site is 
given by 


r = n° = 


■,-E° 


z 


g(T\/2 In L 


Pr(L0 


( 9 ) 


Le-V2 ’ 


(18) 
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such that it vanishes in the thermodynamic limit, L 
oo. In genomes of lengths in the range L = 10® — 10® 
bp the freezing transition happens in the range Uf ~ 
\/2 InL = 5.3- 6.4/fcBT. 

In sum positioning of a single nucleosome is determined 
by the disorder strength. It is well positioned on DNA 
of length L (such that 7^ ~ 1) with energetic Gaussian, 
uncorrelated disorder with width a for a ;g> •\/21nL and 
is "smeared" along the DNA (such that 7^ 1) in the 

opposite limit of weak disorder, a ^2 In L. In Fig. 
the above considerations are illustrated. In Appendix 
|A l| we discuss positioning on energy landscape with non- 
Gaussian distributions. Now we turn to discuss position¬ 
ing of many nucleosomes on the DNA. 



Figure 2: Positioning of a single nucleosome on the DNA with 
length L — lO^bp, such that the freezing transition is at at ~ 
5.3kBT. Occupancy of the deepest energy well is plotted vs. 
disorder width. The dots represent numerical simulation— 
median of 100 realizations of the disordered energy landscape 
energy. The lines represent the analytic solution: Eq. (161 for 
CT > (Tf and Eq. (181 for a < at. Inset: the same plot in the 
linear scale. 


V. POSITIONING OF MULTIPLE 
NUCLEOSOMES WITH ONLY HARDCORE 
INTERACTIONS 


Here we analyze positioning of TV 3> 1 nucleosomes 
with only hard-core interactions. The study of parti¬ 
cles with hard-core repulsions has a long history and is 
relevant to many applications. In the context of protein- 
DNA binding such a repulsion between proteins leads to 
crowding and influences the binding properties 
Here we consider positioning of nucleosomes of a finite 
size, lU > 1 |451 H5] and focus on positioning properties 
on different energy landscapes. 


A. Designed energy landscape 


Consider first a designed case when there are N nucle¬ 
osomes of size W on a DNA of length L and N energy 
wells of energy —E while the rest of the DNA positions 
have zero energy. For the best positioning all the dis¬ 
tances have to be larger or equal to W, such that nu¬ 
cleosomes don’t have to overlap to occupy all the energy 
wells. In this case to position well the nucleosomes one 
needs E > In Then, having, say, 10 — 70 — 90% cov¬ 
erage of the DNA by nucleosomes of length W = 147, 
to position the nucleosomes one needs energy well to be 
deeper than 7 — 4 — SfcsT. Moreover, even if the wells are 
that deep but the number of nucleosomes differs from the 
number of energy wells the positioning is getting worse. 
To make the positioning more robust one can make the 
distance between the wells being random. However, do¬ 
ing this one has to keep the minimal distance between two 
neighboring wells to be FF. Otherwise, the nucleosomes, 
being not able to overlap, spread more on the DNA de¬ 
creasing the positioning parameter, V (see Fig. [^. In 
a more generic, disordered case the positioning is more 
problematic. We turn now to discuss this case. 



Figure 3: Numerical results for positioning of nucleosomes 
with only hard-core interactions with W = 1476p on designed 
energy landscapes of a length of L = 10^ x W. Positioning 
parameter is plotted as a function of the coverage fraction 
p = NW/L. The energy profile is designed such that neigh¬ 
boring energy wells are separated by a distances with the 
following properties: Each distance is W + 63bp (circles), any 
distance is a sum of W and a number drawn from a geometric 
distribution with an average of 636p (squares) and each dis¬ 
tance is a number drawn from a geometric distribution with 
an average of W + 63bp (diamonds). The depths of the energy 
wells relative to the rest positions on the DNA are: (a) 2, (b) 
5 and (c) 8 ksT. The lines are to guide the eye. 
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B. Disordered energy landscape with Gaussian 
distribution 


VI. POSITIONING OF STRONGLY 
INTERACTING NUCLEOSOMES 


Consider positioning of N nucleosomes on uncorre¬ 
lated disordered energy profile normally distributed with 
standard deviation a (see Eq. (§)• In the regime a <C 

^2 In ^ the nucleosomes are poorly positioned, while in 
the opposite regime, 


a- > 




(19) 


the positioning is good. In sum, having, say, 10—70—90% 
coverage of the DNA by nucleosomes of length W = 147, 
to position the nucleosomes one needs disorder strength, 
a to be larger than 3.8 — 2.9 — 2AkBT. 

The derived requirement for positioning may sound 
weak. However, in fact it means that, say, for p = 70% 
and a = bk^T (moderate positioning regime, V ~ 0.6, 
as shown in Fig. the typical energy well for a nucle- 
osome is 14 ± 2^3^ deep (see Eq. 0 with L replaced 
by L/N), relative to a random DNA sequence. In sum, 
one can see that without interactions the energy varia¬ 
tions required for a good positioning seem to be above 
the ones measured in experiments [iiHin]- In Appendix 
|A 2| we discuss positioning of nucleosomes with only hard¬ 
core interaction on energy landscape with non-Gaussian 
distributions and show that the results in this Section do 
not change qualitatively in this case. In the next section 
we show how interactions between nucleosomes allow to 
position them with much weaker energetic disorder along 


Consider N nucleosomes on a DNA of length L. As 
we show above, weak sequence specificity cannot posi¬ 
tion nucleosomes. In this Section we analyze positioning 
of interaction nucleosomes on designed and disordered 
energy landscapes. 

Interaction between neighboring nucleosomes, were 
suggested before as one of the driving forces, ordering 
nucleosomes [27H291I47]. Here, we consider for simplic¬ 
ity the minimal model of an interaction with an energy 
well when two neighboring nucleosomes are at a distance 
R — A<r<R + A from each other. Namely, the inter¬ 
action potential is of the form 


.-V{r 


0 r<W 

= <,e-’ = n R + A>r>R-A>W (20) 
1 else. 


For simplicity we analyze narrow interaction energy well 
of only one bp, A = 0. Further we discuss other possible 
potential functions in general and, in particular, impor¬ 
tance of a finite width of the interaction potential well, 
A > 0. 


OO 








Figure 4: Positioning of nucleosomes with only hard-core in¬ 
teractions. Entropy of the DNA with length L — 10® x W 
and W = 147 as a function of the coverage fraction p = 
NW/L. The markers represent numerical simulation for 
a = 2 (squares), 5 (circles), 8 (diamonds) and 11 (dots) k^T. 


For strong enough interactions strength (large values 
of k) the nucleosomes gather to clusters, such that in 
each cluster the distance between the neighboring nu¬ 
cleosomes is R. As we show below this clustering effect 
can significantly improve positioning of nucleosomes. We 
demonstrate it first on a designed energy landscape. 














7 


A. Designed energy landscape 


B. Disordered energy landscape with Gaussian 
distribution 


In order to exploit interactions between nucleosomes 
and position them on a weak but designed energy land¬ 
scape one should have a spatial resonance between the en¬ 
ergy wells distance and the preferable distance between 
neighboring nucleosomes, R. Consider first the case of 
periodic array of wells with energy —E, such that the 
affinity is AT = s'®, and set the number of nucleosomes 
to be the number of wells. Due to interactions nucleo¬ 
somes locally crystallize to ordered arrays with nearest- 
neighbors distance of R. Denoting the average length of 
a cluster by M, one gets the condition for the positioning 
of the cluster (and, therefore, all the nucleosomes in the 
cluster): 


ME > In 


L 

N/M 


or 


L 

^ ^ TV' 


( 21 ) 


( 22 ) 


The average number of nucleosomes in a cluster, M, is 
given by 


M = - -^ 

1 - Pr(r = R) ’ 


(23) 


where Pr(r = R) is the probability that the distance be¬ 
tween two neighboring nucleosomes is R. These quanti¬ 
ties can be calculated using the self-consistency equation: 


, . 1 Kk^ 

Pr(r = i?) = 1-= - 


N/M 


(24) 


The solution is given by 


M = 




< 1 


(25) 


In the case when the energy along the DNA is a set 
of independent normally distributed variables with stan¬ 
dard deviation a (see Eq. (©), local crystallization of 
interacting nucleosomes also plays a major role in posi¬ 
tioning for small values of ct. As shown in Section [VB[ 
with only hard-core interactions the positioning is possi¬ 
ble only when condition (191 holds. Here we discuss the 
opposite limit of weak disorder and show that interac¬ 
tions between the nucleosomes can position nucleosomes 
even in this case. Consider a cluster of crystallized nucle¬ 
osomes. The effective energy landscape for such a cluster 
possesses a stronger disorder than for an individual nucle- 
osome. Namely, for a cluster of, say, m nucleosomes the 
standard deviation of cluster’s total energy distribution 
is y/ma, where ct, as before, is the energy standard devi¬ 
ation of a single nucleosome. However, for m 1, this 
effective energetic disorder possesses an approximate pe¬ 
riodicity of R because shifting the cluster by this length 
the total energy of the cluster does not change much. 
Nevertheless, local crystallization, increasing the effective 
disorder strength relative to the one for a single nucle¬ 
osome, causes the positioning of clusters and, therefore, 
positioning of individual nucleosomes. 

Consider a single typical cluster of a size M /$> 1. Typ¬ 
ical available space for it is given by while the typ¬ 
ical minimal energy is given by In R. Therefore, 

it will be frozen if 


'/Mct 3> \/2 In i? 


(26) 


and will be "smeared" on its available space in the oppo¬ 
site limit. 

The value of M can be estimated in the following way, 
using Eq. p3l: 


Pr(r = i?) = l-= — 

M K - 


(27) 


N/M 


Thus, 


Therefore, satisfying condition ( p^ , one gets strong 
improvement of the positioning {V ~ 1) even with very 
weak wells AT ~ 1. However, if the wells do not have 
a good periodicity or their period is different from the 
preferential distance between the nucleosomes the inter¬ 
actions do not improve the positioning. 

In sum, one can position strongly interacting nucleo¬ 
somes on a designed energy landscape with shallow en¬ 
ergy wells. However, this positioning is not robust to 
change of nucleosome coverage fraction and requires fine 
tuning of the distances between energy wells along the 
DNA. We turn now to discuss positioning on generic en¬ 
ergy landscapes where the positioning is not so strong 
but is more robust to properties of the energy landscape 
and the interaction potential. 


M = 


K L ^ T?> jY 


K 


(28) 


N 


Combining Eqs. (26) and (28), the required strength of 


interactions to position nucleosomes is given by 

4 W", 2 „ 

K » — In i?. 

CT^ p 


(29) 


In the case when ct ^ \/2 In i? condition (291 has to be 


replaced by At However, in this case condition 


(191 is satisfied, such that the positioning is possible with 
only hard-core interactions. Thus, as shown in Fig. if 
at least one of the conditions (191 and (p^ holds the 


positioning is good, such that 7^ ~ 1, while otherwise 

























7^ <C 1. One can see that strong interactions between 
nucleosomes are able to improve their positioning. In 
Appendix I A 3| we discuss positioning of strongly interact¬ 
ing nucleosomes on energy landscape with non-Gaussian 
distributions and show that the results in this Section do 
not change qualitatively in this case. 

The described local clustering of nucleosomes not only 
improves positioning of nucleosomes but also has another 
consequences—large length scale fluctuations of occu¬ 
pancy. We turn now to discuss this aspect of interactions- 
assisted positioning of nucleosomes. 



Figure 6: Positioning of nucleosomes with interactions on 
Gaussian energy landscape. Positioning parameter, V for the 
DNA with length L — 10"^ x W and W = 147 for the coverage 
fraction p = NW/L = 80% is plotted vs. disorder strength 
(left axis) and interaction strength (bottom axis) with prefer¬ 
able distance of R = 148. On the top one can see the av¬ 
erage size of the crystallized cluster of nucleosomes, derived 
from Eq. (28l. On the right the typical binding energy of a 
nucleosome (relative to the average energy) is shown. The 
lines represent the analytic conditions for a good positioning, 
Eqs. (|19|l (dotted line) and (291 (solid line). 


C. Large-scale fluctuations of occupancy 

Apart from the positioning of nucleosomes on 
small length-scale, there is another feature that is 
highly influenced by interactions between neighboring 
nucleosomes— the large-scale fluctuations of occupancy 
of nucleosomes. One can also interpret this effect as 
long length scale positioning. Without interactions oc¬ 
cupancy Pi averaged over thousands of base-pairs is not 
expected to deviate significantly from its average value, 
p. However, if neighboring nucleosomes possess a pref¬ 
erential distance, R which is smaller than the average 
linker length, W/p, and the nucleosomes are very well 
positioned one gets long DNA regions which are enriched 
by nucleosomes and, therefore, regions depleted with nu¬ 


cleosomes. The correlation length of the occupancy on 
enriched regions for strongly interacting and very well 
positioned nucleosomes is given (using Eq. (28)) by 


RM 


R 



(30) 


On this length scale the nucleosomes are enriched and 
their mean occupancy is given by W/R. 

The distance between the clusters of crystallized nu¬ 
cleosomes (with highly depleted occupancy) scales as 


- RM=(—-R\.f^. (31) 

N/M \p j y W ^ ’ 

In sum, strongly interacting and well positioned nucle¬ 
osomes are expected to exhibit highly fluctuating occu¬ 
pancy on large length-scales. In Section |VIII| we discuss 
how the described above considerations are relevant for 
more realistic energy landscapes and existing experimen¬ 
tal data, but before that we consider effects of another 
important feature of real systems—autocorrelation of the 
energy landscape. 


VII. ENERGY LANDSCAPE WITH 
CORRELATIONS 


So far we discussed random, non-correlated energy 
landscapes. However, DNA sequence possesses correla¬ 
tions |48]. Moreover, even on a random DNA sequence 
an energy landscape is expected to be correlated for dis¬ 
tances smaller than 147 because small shifts of nucleo¬ 
somes along the DNA does not change completely the 
sequence covered by the nucleosome. This is why, as we 
discuss in the next Section, real energy landscape are ex¬ 
pected to possess certain autocorrelation. In this Section 
we discuss how the autocorrelation of the energy land¬ 
scape affects positioning of nucleosomes. 

We analyze the following scenario in this Section. The 
energy landscape is assumed to be Gaussian (see Eq. 
([^) with an exponentially decaying autocorrelation, such 
that 


{EiEi^r) 


(32) 


Here, rc 1 is the correlation distance, such that for 
distances much larger than Tc the energies are nor corre¬ 
lated, while for distance much smaller than Tc the varia¬ 
tion of energy is much smaller than a. This model can be 
mapped to the generalized Random Energy model |49| . 
The condition for a good positioning on the single/few 
bp resolution, we derive below, correspond to the low¬ 
est /high temperature phase transition of that model, re¬ 
spectively. 

In this Section we assume that for Tc = 0 the nucleo¬ 
somes are well positioned on the DNA. The correlation is 
an additional trouble for positioning and here we derive 
an additional condition for a good positioning in presence 
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of the autocorrelation, on top of the conditions ( 19|29 1, 
for the non-correlated energy landscapes. We start from 
the simplest single-nucleosome case. 

Consider a single nucleosome on DNA of length L 
with an Gaussian energy landscape with standard devi¬ 
ation a and exponential autocorrelation with correlation 
length Conceptually we divide the DNA to Llr^ 

"boxes" of length Vc- In order to position the nucleo- 
somes in the box with the highest affinity one needs to 

satisfy a ^ ^/2In ^. This condition is satisfied because 


we assume here that without autocorrelation the posi¬ 
tioning is good and, therefore, a ^ InL. Thus, the 
problem is the positioning of the nucleosome within the 
box. 

Within the box all the energies are highly correlated. 
A way to generate such a correlated energy landscape is 
to set [KH] 


Ei+i — e Ei-\- ^ 1 — e ~^Gi, (33) 

where Gi is an uncorrelated set of Gaussian random vari¬ 
ables with standard deviation a. Thus, the standard de¬ 
viation of energies, E^, if i is in the range much smaller 
than Tc is given by With such a standard devia¬ 

tion, to position a nucleosome in a box of size Tc, one 
needs » J2 Inr^ or 

V2i-c 

a ^ \/ 4rc In r^. (34) 


This is a strong constrain on the positioning. Even for 
Tc = 5bp the positioning is bad unless a is much larger 
than 6kBT. 

Condition (34) remains the same also for the case of 
non-interacting nucleosomes or nucleosomes with only 
hard-core interactions. This is because (34) does not de¬ 
pend on the length of DNA per nucleosome but only on 
the correlation distance of the energy landscape. This 
makes positioning of non-interaction (or interacting with 
only hard-core repulsion) extremely problematic. In the 
next Section we show that the value of Vc is, at least, tens 
of base-pairs. For such a correlated energy landscape the 
positioning condition (34) is not expected to be satisfied. 

For interacting nucleosomes the standard deviation 
of the effective energy landscape is given by 
where M is the average number of nucleosomes in a crys¬ 
tallized cluster and given by Eq. (28). The positioning 
condition for strongly interacting nucleosomes is given by 


a » 



(35) 


imply that even for short-range autocorrelation of the 
energy profile with a realistic value of a it impossible to 
position properly nucleosome on a single bp resolution 
without strong interactions between them. 



Figure 7: Positioning of nucleosomes with interactions on 
Gaussian energy landscape with an exponentially decaying 
autocorrelation with correlation coefficient Vc = 20. Position¬ 
ing parameter, V for the DNA with length L — xW and 
W = 147 for the coverage fraction p = NW / L = 80% is plot¬ 
ted vs. disorder strength (left axis) and interaction strength 
(bottom axis) with preferable distance of 7? = 148. On the 
top one can see the average size of the crystallized cluster of 
nucleosomes, derived from Eq. (28l. On the right the typical 
binding energy of a nucleosome (relative to the average en¬ 
ergy) is shown. The line r epre sent s th e analytic conditions 
for a good positioning, Eq. (351 or (36l. 


However, the positioning on correlated energy land¬ 
scape is easier if one allow the nucleosome to be posi¬ 
tioned within a few bp. In order to see it we exploit the 
positioning function, defined in Eq. This function, 
Vk, characterizes the positioning within the resolution of 
k base-pairs. In Fig. one can see that in some cases 
even when the single bp resolution positioning is bad, 
7^ = 7^1 ^ 1, the positioning within k = 3, 5,... is signifi¬ 
cantly better. The condition for Vk to be of the order one 


is equivalent to condition (351 or (361 with rc replaced 


by r^/k. Namely, the condition for a good positioning 
within k bp is given by 


a 


4rc ^ 
Mk k 


(37) 


or, equivalently. 


W16r2 

K 3>-W In 

p 


(36) 


In Figs. [7] and [8] one can see the comparison of condition 




(38) 


One can see in Fig. [^th at condition ( [37| ) (or (381) can be 
much weaker than (|35[) (or (36l). In the next Section we 


(35) or (36) to numerical results. The obtained results study empirical landscape for which tr is roughly l-bk^T 
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and Tc is roughly lOObp. Thus, if the interaction between 
nucleosomes is strong enough to crystallize them to clus¬ 
ters of size M = 10 — 100 nucleosomes, one would expect 
to see bad positioning on the level of a single bp resolu¬ 
tion with Vi — 0.1 — 0.25 ((35l does not hold) but within 
fc = 9 bp the nucleosomes are positioned significantly 
better (([37]) does hold) with Vg = 0.5 - 0.8 (see Fig. j^. 
In the next Section we discuss positioning properties on 
empirical energy landscape and, in general, relevance of 
the above considerations to real systems. 



Figure 8: Positioning of nucleosomes with interactions on 
Gaussian energy landscape with an exponentially decaying 
autocorrelation with correlation coefficient — 100. Posi¬ 
tioning parameter, V for the DNA with length L = 10^ x W 
and W = 147 for the coverage fraction p = NW/L = 80% 
is plotted vs. disorder strength (left axis) and interaction 
strength (bottom axis) with preferable distance of 7? = 148. 
On the top one can see the average size of the crystallized 
cluster of nucleosomes, derived from Eq. (281. On the right 
the typical binding energy of a nucleosome (relative to the av¬ 
erage energy) is shown. The solid line represents the analytic 
conditions for a good positioning on a resolution of fc = Ibp, 
Eq. (jSSll or (36l. The dahsed line represents the analytic con¬ 


ditions for a good positioning on a resolution of fc = 9bp, 
Eq. (371 or (381. 


VIII. RELEVANCE TO EMPIRICAL RESULTS 

So far we discussed positioning on artificial energy 
landscapes and dissected the phase diagram to different 
regimes. An obvious question to ask now is: where is the 
real system on the phase diagram? In this Section we try 
to get insight into this. To do so, we calcnlate an energy 
landscape nsing a model in Ref. m- In fact, there are 
many different models for a binding energy of a nncleo- 
some to a given sequence (examples inclnde |51IlM] . for 
review see, e.g., |55[ [55] b We exploit only one of them, 
from Ref. m, because we are not interested in predict¬ 


ing locations of nucleosomes on some piece of DNA but in 
general properties of positioning of nucleosomes. In par- 
ticnlar, our goal in this section is to validate our results 
on artificial energy landscapes and show their relevance 
to more realistic energy landscapes. 

We start with contrasting the presented, artificial en¬ 
ergy landscapes and the one generated nsing the model 
in Ref. |51| on the genome of S. cerevisiae. First thing 
to note is that the distribution of the binding energies 
on the S. cerevisiae genome is close to, but deviates 
from a Gaussian (see Fig. 10). The standard deviation 
of the energy landscape is given by l-Qk-eT. Interest¬ 
ingly, as shown in Fig. the same model on a ran¬ 
domly shuffled S. cerevisiae genome yields significantly 
narrower energy landscape, well fitted by a Ganssian with 
a — 1.2Ak-QT. The energy landscape, generated by the 
model for Ref. |53| predicts even narrower energy land¬ 
scape (also shown in Fig. 10). 

Energy landscape with snch a narrow distribution even 
with no autocorrelation is not expected to position well 
the nucleosomes without strong interactions (see Eq. (191 
and Fig. j^. Interaction between nucleosomes can im¬ 
prove the positioning (see Eq. (29) and Fig. [^. 

However, as is expected, the calcnlated energy land¬ 
scape possesses certain autocorrelation. Some part of 
this autocorrelation is becanse shifting a nncleosome a 
few bp doesn’t change entirely the bonnd seqnence. This 
sort of autocorrelation, for distances smaller than 1476p 
exists even on a randomly shuffle genome (see Fig. 111. 
One can clearly see the periodic oscillations with an ap¬ 
proximate lObp period [111 I57| . 

On top of that, due to some sequence correlation along 
the real, non-shuffled, S. cerevisiae genome, the autocor¬ 
relation of binding energy persist even for distances larger 
than 1476p (see Fig. 11). One reason for snch a long-scale 
autocorrelation is that model in Ref. |HI] is biased by GG 
content |58] and GG content possess significant autocor¬ 
relation along the S. cerevisiae genome (see Fig. 12). 
The antocorrelation function clearly deviates from the 
simple exponential decay, which we assumed in our the¬ 
oretical considerations above. Ronghly, the correlation 
distance is close to lOObp, making the positioning mnch 
more problematic, relative to nncorrelated energy land¬ 
scape with the same standard deviation (see Fig. 13 a) 
vs. (b)). 

On snch an anto-correlated energy landscape, with 
only hardcore interactions, n = 0, nucleosomes are not 
well positioned on a single bp resolution (see a typical 
Ui profile in Fig. 12). Namely, the single bp positioning 
parameter, V = 0.06 (calculated on the first chromo¬ 
some of S. cerevisiae), is much smaller than 1. With 
only hard-core interactions the peaks in are not only 
low, bnt also wide. In fact the positioning fnnction Vk, 
shown in Fig, [l^a), demonstrates that the nucleosomes 
are "fuzzy" and poorly positioned even on the resolution 
of 105p. 

The absence of good positioning can be partially at¬ 
tributed to strong autocorrelation of energy, because the 
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Figure 9: Positioning of nucleosomes with interactions on Gaussian energy landscape with an exponentially decaying autocor¬ 
relation with correlation coefficient rc = 100. Positioning parameter, Pk for the DNA with length L = 10^ x W and W = 147 
for the coverage fraction p = NW/L = 80% is plotted vs. k for different disorder strengths (left axis) and interaction strengths 
(bottom axis) with preferable distance of i? = 148. The line represents the analytic conditions for a good positioning on a 
single bp resoltion, k — 1, Eq. (|35l) or (36l. 


distribution of uncorrelated Gaussian energy landscape 
with fj = l.bfceT' results in much better positioning, as 
is discnssed in Section |V B[ Indeed, calcnlation of posi¬ 
tioning of nucleosomes with only hard-core interaction 
on randomly shuffled energy landscape, calcnlated nsing 
model in Ref. ISU resnlts in narrow (Ibp) peaks of rii of 


an average height of = 7^ = 0.18 (see Fig. [I^b)). 

In contrast to the case with only hard-core interac¬ 
tions, the nncleosomes can be positioned mnch better 
in presence of interactions between neighboring nucleo¬ 
somes. For example, for the interaction strength of n = 9 
and the preferable distance R = 154 the peaks of rii 
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E 

Figure 10: Distribution of the binding energy of a nucleo- 
some on a S. cerevisiae (circles) and randomly shuffled S. 
cerevisiae genome (squares), calculated using the model in 
Ref. |51| . The standard deviation is equal to l.GkBT for S. 
cerevisiae and 1.24A;bT for randomly shuffled S. cerevisiae 
genome. The diamonds represent the distribution of the en¬ 
ergy landscape calculated in Ref. |53j for S. cerevisiae genome 
with a standard deviation of Q.^k^T. The lines are Gaussian 
fits with zero mean and respective standard deviations. 



Figure 11: Autocorrelation of the binding energy of a nu- 
cleosome on a S. cerevisiae (black solid line) and randomly 
shuffled S. cerevisiae genome (dotted, red line), calculated us¬ 
ing the model in Ref. |51| . The inset is a zoom in on the main 
plot. 


are much higher, such that the positioning parameter is 
Vi — 0.3 (see Figs. [I^ and [I^c)). The width of the peaks 
is 3 — 5bp, such that the positioning function is Vk ~ 0.45 
for fc > 3, as can be seen in Fig. @c). 

Can one position nucleosome with a more realistic in¬ 
teraction potential? A reasonable choice seems to be 


the one used in Ref. to fit qualitatively the lOn -|- 5 
(or, sometimes, 10.6n-|-8 m) periodicity found in many 
works, starting from Ref. m- In order to verify that our 
results do not depend qualitatively on the precise form 
of the interaction potential we used the same form as in 
Ref. [25, but with higher prefactor (12 instead of 5) and 
with a cutoff of 180bp for the computational purposes: 


V{r) = 


l2kBT cos 
0 


/ 2-7T{r-W) 
lObp 


r-W 
■ 50bp 


r <W 
VF < r < 180 
r > 180 


(39) 

In Fig. p^d) one can see that the interaction poten¬ 
tial in Eq. (|39[) is able to position nucleosomes on the 
energy landscape generated using the model in Ref. m 
within the resolution of 3 — 5bp. In the next Section 
m we analyze in more detail robustness and tunability 
of positioning to different properties of the interacting 
potential. 

In sum, these results indicate that good position¬ 
ing of nucleosomes is possible even on a realistic en¬ 
ergy landscapes with narrowly distributed (small a) 
and highly auto-correlated (large Vc) energies, provided 
strong enough interactions between them. 

Beyond the positioning of nucleosomes on small length 
scales, as we described in Section VI C| strong interac¬ 
tions between neighboring nucleosomes change DNA oc¬ 
cupancy by nucleosomes on large length scales. We turn 
now to discuss how the theoretical predictions in Section 
IVICI are relevant for realistic scenarios. 


A. Large-scale fluctuations of occupancy 


We start with nucleosomes with only hard-core inter¬ 
actions, as a reference case. In this case, since the energy 
profile is not well correlated for long distances, the oc¬ 
cupancy does not fluctuate significantly on large length 
scales (above a few nucleosome repeat lengths), as how 
in Figs. |T2| and pT| However, as discussed in Section 
I VIC I interactions locally crystallize nucleosomes, induc¬ 
ing large scale fluctuations in occupancy. In Figs. 
(zoom on first 7 ■ 10'^ bp of chromosome I of S. cerevisiae) 
and 14 (whole chromosome I of S. cerevisiae) one can 
see that interacting nucleosomes are distributed nonuni- 
formly along the DNA, in contrast to nucleosomes with 
only hard-core interactions. Interestingly, this sort of 
large length scale nonuniformity one also observes on the 
single-bp resolution data [^ (see Figs. [I^ and 141. More¬ 


over, the calculated occupancy seems to follows quite con¬ 
sistently the experimental one on different length scales. 
This is especially surprising because the data from [52] is 
based on chemical cleavage, while the model used by use 
to calculate energy profile along the DNA was derived 
based on MNase digestion m- 

These results indicates that the interactions between 
the nucleosomes help to position them not only on the 
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Figure 12: Analysis of the first 7 • 10"^6p of S. cerevisiae’s first chromosome. The lines on top of the figure depict exons. In 
panel number (I) the GC content is plotted. (II) Binding energy profile calculated using model from Ref. I51| . The additive 
constant term is such that the mean energy along the chromosome is zero. (II) Calculation of the nucleosome distribution for 
interacting nucleosomes with R = 154 and v = 9. The chemical potential is such that p = 0.8. The lines with dots represent 
Ui values, while the thick, red line represent occupancy level. The thin black line is the occupancy from the data in Ref. |62) . 
Both the occupancies are smoothed, such that only 100 lowest Fourier modes are presented. (Ill) The same as for the (II) 
panel but with only hard-core interactions, v = 0. 


short length-scales (a few bp), but also on the long 
length-scales, inducing large long-scale fluctuations in nu¬ 
cleosomes occupancy. In other words, strong interactions 
between the nucleosomes naturally yields long nucleo¬ 
somes diluted and enriched regions along the genome. 

We summarize this Section with the following con¬ 
clusions. On a realistic energy landscape nucleosomes 
are much better localized in presence of strong interac¬ 
tions. In this case the calculated results, predicting large 
length scale occupancy fluctuations, agree qualitatively 
and, surprisingly, quantitatively with the experimental 
data. We turn now to a more detailed analysis of how 
properties of the interaction potential affect distribution 
of nucleosomes along the genome. 


IX. TUNABILITY AND ROBUSTNESS OF THE 
POSITIONING 


Several relevant question are still to be answered. How 
robust are the obtained results for the made assump¬ 
tions? What if the interaction potential possesses a cer¬ 


tain width (A > 1 in Eq. (201)? How robust the positions 


of nucleosomes to change of parameters, like the strength 
of the interactions v, preferable distance, i?, number of 
wells, and the width of the interactions potential, A. 
Can one "tune" the distribution of nucleosomes along 
the DNA changing (locally or globally) the parameters 
of the interaction potential? 

We addressed some of these issues above, considering 
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Figure 13: Positioning goodness in different cases. In each panel the left plots are the average prohle of the N largest values of 
rii (insets are zoom out of the main plots). The right plots are the positioning function Vk vs. k. (a) Energy landscape from 
Ref. [51| with only hard-hore interactions between the nucleosomes, u = 0. (b) The same as in (a) but the energy landscape 
(not the sequence) is randomly shuffled, (c) The same as in (a) but nucleosomes interact with v = 9, R — 154, A = 0. (d) The 
same as in (a) but nucleosomes interact with the potential from Eq. ( |39[ ). (e) The same as in (c) but nucleosomes interact with 
A = 2. (f) The same as in (a) but nucleosomes interact with two wells potential, R = 154 and R = 164. 


more realistic potential between the nucleosomes, in Eq. 
(391, and found that the conclusions are quite robust to 
a particular form of the potential. In this Section we ad¬ 
dress these questions more systematically. We focus here 
on the energy profile from the previous Section, calcu¬ 
lated using model from Ref. m and take as a starting 
point potential with a single well with R = 154, v = 9, 
A = 0. In this case the positioning is reasonably good 
(see Figs. 12 and [T^c)). 

We start by changing the width of the interaction po¬ 
tential, A. Before we always (except from analyzing the 
potential form from Eq. (39l) assumed that the interac¬ 
tion potential is sharp to the level of a single bp, A = 0. 
Of course this assumption is not realistic and real effec¬ 
tive interaction potentials probably possess a finite width 
and more than one energy well |201 1551IM] . However, 
changing the width to a few base-pairs does not change 
qualitatively the results. Taking A = 2 the positioning 
remains good on the length scale of 2A + 1 = 5, such that 
the typical peak of rii has a height of 7^ ~ 0.3 and width 


of 5bp, as shown in Fig. [T3[e). As one can see in Fig. 15 


the occupancy on large length scales does not change 
much as one tune the value of A. Even on the level of 
a single bp resolution the Pearson correlation coefficient 
of Pi for a potential with a width of one bp (A = 0) and 
5bp (A = 2) is 0.87. The precise locations of nucleosomes 
do change, however. Looking on the N = pL/W largest 
values of rii for both values of A we observe an overlap 
of 18%. This value makes sense because we expect that, 
upon changing the width of the potential well from 1 bp 
to 5bp, 20% of the nucleosomes will remain in their posi¬ 
tions and 80% will move ±2 bp, within the new potential 
well. In sum, widening of the potential width does not 
change the distribution of nucleosomes on a large scale, 
but makes their position uncertain on the length scale of 
2A + 1. 


We add now one more well to the interaction potential 
at lObp from the first one, R = 154 + 10 = 164. The 
positioning gets slightly worse, such that the typical peak 
of Hi has a height of P ~ 0.2 and width of 3bp, as shown 
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Figure 14: Occupancy of nucleosomes on the global scale of the whole first chromosome of S. cerevisiae. Thin blue line represents 
calculation of the nucleosome distribution for interacting nucleosomes with R — 154 and v = 9. Thick, gray line is the occupancy 
from the data in Ref. |62| . The calculated occupancy for nucleosomes with only hard-core interactions is represented by the 
thin, red, almost flat line. The chemical potential set for calculations is such that p — 0.8. All the occupancies are smoothed, 
such that only 50 lowest Fourier modes are presented. 
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Figure 15: Changing the width of the potential, A. Thick red line represents calculation of the nucleosome distribution for 
interacting nucleosomes with R = 154, v = 9 and A = 0. Thin, blue line represents calculation of the nucleosome distribution 
for interacting nucleosomes with R = 154, v = 9 and A = 2, such that the width of the potential well is 2A -I- 1 = 5bp. The 
chemical potential set for calculations is such that p — 0.8. All the occupancies are smoothed, such that only 100 lowest Fourier 
modes are presented. Locally the occupancy values are also highly correlated. The Pearson correlation coefficient of pi (without 
any smoothing) for A = 0 and 2 is 0.87. The values of pi for A = 0 and 2 differ by 19% on average. However, the locations of 
nucleosomes for A = 0 and 2 signihcantly differ: N = pL/W locations with the highest values of for the two cases possess 
an overlap of only 18%. 
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in Fig. [^f). As one can see in Fig. |16| the occupancy on 
large length scales does not change much as one tune the 
value of A. Even on the level of a single bp resolution 
the Pearson correlation coefficient of pi for a potential 
with one and two wells is 0.75. The precise locations of 
nucleosomes do change, dramatically, however. Looking 
on the N = pL/W largest values of for both values of 
A we observe an overlap of only 3%. In sum, another well 
in the potential width does not change the distribution 
of nucleosomes on a large scale, but does change their 
positions on small length scale. 

The depth of the potential well does affect the goodness 
of positioning and has to be strong enough to have any ef¬ 
fect. However, once a reasonable positioning is achieved, 
its value does not change things much. As one can see 
in Fig. the occupancy on a large length scales does 
not change much as one tune the value of v from 9 to 
11. Even on the level of a single bp resolution the Pear¬ 
son correlation coefficient of for a potential with one 
and two wells is 0.94. Even the precise locations of nu¬ 
cleosomes do not change, dramatically. Looking on the 
N = pL/W largest values of rii for both values of v we 
observe an overlap of 67%. 

As one would expect, the position of the well, R, does 
not affect significantly the positioning properties and dis¬ 
tribution of the nucleosomes on a large scale. As one can 
see in Fig. the occupancy on large length scales does 
not change much as one tune the value of R from 154 to 
160. On the level of a single bp resolution the Pearson 
correlation coefficient of pi for a potential with one and 
two wells is 0.62. However, precise positioning of nucle¬ 
osomes is very sensitive to the value of i?: N = pL/W 
locations with the highest values of Ui for the two cases 
of R = 154 and i? = 160 possess an overlap of only 1%. 

We conclude that, in principle, a cell, tuning the prefer¬ 
able distance between nucleosomes, can control their dis¬ 
tribution along some part of DNA without changing sig¬ 
nificantly large scale properties, like an average position¬ 
ing goodness and large length scale occupancy. 


neighboring nucleosome we derive condition for a good 
positioning. We briefly summarize the conditions in the 
following paragraph. 

Assuming that neighboring nucleosomes possess a 
preferable distance with an affinity k 3> 1, relative to 
other distances, the number of locally crystallized nucle¬ 
osomes, M ^ 1, is given by Eq. (281, where N is the 


average number of nucleosomes and L is the length of 
the DNA. The the positioning is expected to be good 
on an uncorrelated Gaussian disorder with standard de¬ 


viation a is condition Eq. (261 holds. On a correlated 


energy landscape with a certain correlation distance, Tc, 
the positioning conditioning depends on the required res¬ 
olution. For a good positioning within k bp it is given by 
Eq. fSTl). 


Importantly, without strong interactions, k ~ 1, the 
conditions do not seem to hold for realistic parameters, 
indicating an important role of effective interactions be¬ 
tween the nucleosomes in their positioning. If the posi¬ 
tioning, as we suggest, is controlled by interactions, one 
expect to see long length-scale fluctuations of nucleosome 
occupancy. This conclusion agrees with empirical data on 
occupancy of nucleosomes. Moreover, the derived large 
length-scale occupancy profile, derived from an effective 
energy landscape and interaction potential which is suffi¬ 
cient to position the nucleosomes is similar to the empiri¬ 
cal one. We also analyze the robustness of positioning to 
parameters of the model. The parameters can vary with 
time and be different on different parts of the genome. 
In fact, as we demonstrate, tuning some parameters one 
can dramatically change the distribution of nucleosomes. 
In sum, our study emphasizes an important role of inter¬ 
action between the nucleosomes and indicates range of 
parameters needed for it. We expect this knowledge to 
be important for better understanding of organization of 
our epigenome. 


Appendix A: Non-Gaussian energy landscapes 


X. SUMMARY 

In this article we focus on goodness of positioning of 
nucleosomes on the DNA. We make several simplifying 
assumptions. We assume that we can take into account 
all the positioning factors by have an effective energy 
landscape and an effective interaction potential between 
neighboring nucleosomes. We ignore that nucleosomes 
can invade each others DNA territories. In addition we 
analyze only the equilibrium distribution of nucleosomes, 
ignoring very probable non-equilibrium aspects of nucle¬ 
osomes positioning. However, even within this simplified 
framework we clarify a few aspects of nucleosome posi¬ 
tioning, which do not seem to depend on these details. 

Looking on a generic energy landscape with some en¬ 
ergy distribution width, some energy typical autocorre¬ 
lation distance and some interaction potential between 


The distribution of the binding energies along the DNA 
does not have to be Gaussian. Here we analyze two other 
possible scenarios for the energy landscapes. 

1. Positioning of one nucleosome 

Here we discuss positioning of a single nucleosome on 
up-exponential and down-exponential energy landscapes. 

a. Disordered energy landscape with down-exponential 
distribution 

Gonsider scenario with what we denote as down- 
exponential distribution: 

Pr(ifi) = ; ~oo < Ei <0 ; £ > 0. (Al) 
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Figure 16: Changing the number of wells of the potential. Thick red line represents calculation of the nucleosome distribution for 
interacting nucleosomes with R = 154, v = 9 and A = 0. Thin, blue line represents calculation of the nucleosome distribution 
for interacting nucleosomes with two wells (both with A = 0) i? = 154 and R — 164, v = 9. The chemical potential set for 
calculations is such that p = 0.8. All the occupancies are smoothed, such that only 100 lowest Fourier modes are presented. 
Locally the occupancy values are also highly correlated. The Pearson correlation coefficient of pi (without any smoothing) 
for one and two wells Is 0.75. The values of pi for one and two wells differ by 18% on average. However, the locations of 
nucleosomes for one and two wells significantly differ: N = pL/W locations with the highest values of rii for the two cases 
possess an overlap of only 3%. 



Figure 17: Changing the strength of the potential, v. Thick red line represents calculation of the nucleosome distribution for 
interacting nucleosomes with R = 154, v = 9 and A = 0. Thin, blue line represents calculation of the nucleosome distribution 
for interacting nucleosomes with R = 154, u = 11 and A = 0. The chemical potential set for calculations is such that p = 0.8. 
All the occupancies are smoothed, such that only 100 lowest Fourier modes are presented. Locally the occupancy values are 
also highly correlated. The Pearson correlation coefficient of pi (without any smoothing) for v = 9 and 11 is 0.94. The values of 
pi for V = 9 and 11 differ by 15% on average. In this case even the locations of nucleosomes for u = 9 and 11 do not significantly 
differ: N = pL/W locations with the highest values of rii for the two cases possess an overlap of 67%. 


In this case the typical minimal energy can be estimated 
using Eq. (101 to be 


E° ~ -£\nL. 


(A2) 


The partition function be be estimated separately in two 
regimes (two phases in the thermodynamic limit): in the 
nonfrozen regime, £ <^ 1 the partition function is given 

by 


e"-® RT:{E)dE 


L 

l-£' 


The positioning parameter is this case is given by 


V ~ < 1 , 


(A4) 


such that the positionig is poor for £ <C 1. 


In the opposite regime £ ^ 1 the integral in Eq. (A3 \ 


diverges and the partition function is dominated by the 
deepest wells. Thus, it can be estimated using fc-minimal 
energies, given by 


L 


— oo 


(A3) 


E°k^£\n^. 


(A5) 
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i x10^ 

Figure 18: Changing the well location of the potential, R. Thick red line represents calculation of the nucleosome distribution 
for interacting nucleosomes with R = 154, u = 9 and A = 0. Thin, blue line represents calculation of the nucleosome distribution 
for interacting nucleosomes with R = 160, u = 9 and A = 0. The chemical potential set for calculations is such that p = 0.8. 
All the occupancies are smoothed, such that only 100 lowest Fourier modes are presented. Locally the occupancy values are 
also highly correlated. The Pearson correlation coefficient of pi (without any smoothing) for R = 154 and 160 is 0.62. The 
values of pi for R — 154 and 160 differ by 23% on average. On the resolution of a few bp the locations of nucleosomes for 
R = 154 and 160 are very different: N = pL/W locations with the highest values of n; for the two cases possess an overlap of 
only 1%. 


Therefore, 



(A6) 


such that the positioning parameter in this regime is 
given by 


r ~ 


1 

W)' 


and is larger than 1/2 and close to one for 
£ » C”^(2) 1.7. 


(A7) 


(A8) 


In sum, for down-exponential distribution of energies 
the only requirement for a good positioning is that the 
parameter £ is larger than one for any length of the DNA. 


b. Disordered energy landscape with up-exponential 
distribution 

Consider another scenario with what we denote as 
down-exponential distribution: 

Pr{Ei) = J 0 < Ai < oo ; £ > 0. (A9) 

In this case there are many energies which are close to 
zero, A/ ~ ^2 ~ Ag ~ ... ~ 0. Positiong is impossible 
for such energy landscape; for any value of £ one has 
P ^ 1. However, as we show below interaction between 
nucleosomes can induce reasonable positioning even on 
such a energy landscape. 


2. Positioning of multiple nucleosomes with only 
hardcore interactions 


Here we consider positioning of nucleosomes with 
hard-core interactions on up-exponential and down- 
exponential energy landscapes. 


a. Disordered energy landscape with down-exponential 
distribution 


Consider positioning of N nucleosomes on uncorrelated 
disordered energy profile down-exponentially distributed 
with some parameter £ (see Eq. (All). In the regime 
£ <C 1 the nucleosomes are poorly positioned, while in 
the opposite regime £ 3> 1, the positioning is good. 

The derived requirement for positioning may sound 
weak. However, in fact it means that, say, for p = 70% 
and £ = l.SfceT (moderate positioning regime, V ~ 0.6) 
the typical energy well for a nucleosome is 7.5 ± 2 A:bT 
deep (see Eq. (A2) with L replaced by L/N), relative to 
a random DNA sequence. 


b. Disordered energy landscape with up-exponential 
distribution 


Consider positioning of N nucleosomes on uncorrelated 
disordered energy profile up-exponentially distributed 
with some parameter £ (see Eq. (A9)). In this case, as 
for a single nucleosomes, the positioning is poor for any 
value of £. 
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3. Positioning of strongly interacting nucleosomes 

Here we discuss positioning of strongly interacting nu¬ 
cleosomes on up-exponential and down-exponential en¬ 
ergy landscapes. 


Disordered energy landscape with down-exponential 
distribution 


Disordered energy landscape with np-exponential 
distribution 


Consider positioning of N nucleosomes on uncorrelated 
disordered energy profile down-exponentially distributed 
with some parameter £ (see Eq. (All). In this case a 


cluster of TO 1 crystallized nucleosomes has a Gaussian 
energy landscape with a standard deviation of y/rn£. Us¬ 
ing the same arguments as for the Gaussian disorder one 
can derive two conditions for a good positioning. The 
first is for positioning of weakly interacting nucleosomes, 
such that M ~ 1. In this case the condition is given by 
Eq. dMl. 


If this condition is not satisfied one needs strongly in¬ 
teracting nucleosomes, such that 


or, using Eq. (28), 


VM£ > \/2Ini? 


4 T , 2 


(AlO) 


(All) 


Consider positioning of N nucleosomes on uncorrelated 
disordered energy profile up-exponentially distributed 
with some parameter £ (see Eq. (A9)). In this case, as 
for a single nucleosomes, the positioning is poor for any 
value of £ and one needs strongly interacting nucleosomes 
for a good positioning. 

If nucleosome strongly interact and form crystallized 
clusters of M ^ 1 nucleosomes on average (and this hap¬ 
pens when K ^ ^), the positioning condition is given by 


Eq. (All I. Thus the condition for positioning in this case 
is 


L , 4 L , 2 

AC»-and^-ln R. 


(A12) 
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lines represent the analytic conditions for a good positioning, 
Eqs. (A8l (dotted line) and ( All|| (solid line). 
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coverage fraction p = nWjL smaller than 80% is plotted vs. 
disorder strength (left axis) and interaction strength (bottom 
axis) with preferable distance oi R= 148. On the top one can 
see the average size of the crystallized cluster of nucleosomes, 
derived from Eq. (28l. On the right the typical energy peak 
(not well) is shown on the length of L/N. The line r epres ents 
the analytic conditions for a good positioning, Eq. (A12l. 
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